Stable Classification of Text Genres
نویسندگان
چکیده
منابع مشابه
Squibs: Stable Classification of Text Genres
Every text has at least one topic and at least one genre. Evidence for a text’s topic and genre comes, in part, from its lexical and syntactic features—features used in both Automatic Topic Classification and Automatic Genre Classification (AGC). Because an ideal AGC system should be stable in the face of changes in topic distribution, we assess five previously published AGC methods with respec...
متن کاملThe Form is the Substance: Classification of Genres in Text
Categorization of text in IR has traditionally focused on topic. As use of the Internet and e−mail increases, categorization has become a key area of research as users demand methods of prioritizing documents. This work investigates text classification by format style, i.e. "genre", and demonstrates, by complementing topic classification, that it can significantly improve retrieval of informati...
متن کاملText genres in information organization
Introduction. Text genres used by so-called information organizers in the processes of information organization in information systems were explored in this research. Method. The research employed text genre socio-functional analysis. Five genre groups in information organization were distinguished. Every genre group used in information organization is described. Empirical evidence for genre gr...
متن کاملMachine Translation of Various Text Genres
Machine translation (MT) has been both praised and criticized since the 1930’s when it was first introduced. Today, MT − much improved since then, is a vital tool for the human translator, although not without its problems. One important unresolved issue is the success of MT for different text types. In the present study, we compare the performance of German-English machine translation in four ...
متن کاملClassifying movie genres by analyzing text reviews
This paper proposes a method for classifying movie genres by only looking at text reviews. The data used are from Large Movie Review Dataset v1.0 [4] and IMDb. This paper compared a K-nearest neighbors (KNN) model and a multilayer perceptron (MLP) that uses tf-idf as input features. The paper also discusses different evaluation metrics used when doing multi-label classification. For the data us...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Computational Linguistics
سال: 2011
ISSN: 0891-2017,1530-9312
DOI: 10.1162/coli_a_00052